智能论文笔记

Attributed Network Embedding Model for Exposing COVID-19 Spread Trajectory Archetypes

Junwei Ma , Bo Li , Qingchun Li , Chao Fan , Ali Mostafavi

分类：机器学习

2022-09-20

COVID-19的传播表明，在不同的城市和社区之间，传播风险模式不是同质的，各种异质特征会影响传播轨迹。因此，对于预测性大流行监测，至关重要的是，在城市和社区中探索潜在的异质特征，以区分其特定的大流行扩散轨迹。为此，这项研究创建了一个网络嵌入模型，捕获跨县的访问网络以及异质特征，以根据其大流行传播轨迹来发现美国县的集群。我们从3月3日至2020年6月29日（初始波浪）收集了2,787个县的位置智能特征。其次，我们构建了一个人类访问网络，该网络将县特征作为节点属性和县之间的访问作为网络边缘。我们的归因网络嵌入方法整合了跨县访问网络的类型学特征以及异质性特征。我们对属性网络嵌入进行了聚类分析，以揭示与四个县群相对应的差异风险轨迹的四种原型。随后，我们确定了四个功能是原型之间独特的传输风险模式的重要特征。归因的网络嵌入方法和发现识别并解释了整个县的非殖民性大流行风险轨迹进行预测性大流行监测。这项研究还为大流行分析的基于数据驱动和深度学习的方法有助于补充大流行病政策分析的标准流行病学模型。

translated by 谷歌翻译

DAHiTrA: Damage Assessment Using a Novel Hierarchical Transformer Architecture

Navjot Kaur , Cheng-Chun Lee , Ali Mostafavi , Ali Mahdavi-Amiri

分类：计算机视觉

2022-08-03

本文介绍了Dahitra，这是一种具有分层变压器的新型深度学习模型，可在飓风后根据卫星图像对建筑物的损害进行分类。自动化的建筑损害评估为决策和资源分配提供了关键信息，以快速应急响应。卫星图像提供了实时，高覆盖的信息，并提供了向大规模污点后建筑物损失评估提供信息的机会。此外，深入学习方法已证明在对建筑物的损害进行分类方面有希望。在这项工作中，提出了一个基于变压器的新型网络来评估建筑物的损失。该网络利用多个分辨率的层次空间特征，并在将变压器编码器应用于空间特征后捕获特征域的时间差异。当对大规模灾难损坏数据集（XBD）进行测试以构建本地化和损坏分类以及在Levir-CD数据集上进行更改检测任务时，该网络将实现最先进的绩效。此外，我们引入了一个新的高分辨率卫星图像数据集，IDA-BD（与2021年路易斯安那州的2021年飓风IDA有关，以便域名适应以进一步评估该模型的能力，以适用于新损坏的区域。域的适应结果表明，所提出的模型可以适应一个新事件，只有有限的微调。因此，所提出的模型通过更好的性能和域的适应来推进艺术的当前状态。此外，IDA-BD也提供了A高分辨率注释的数据集用于该领域的未来研究。

translated by 谷歌翻译

A Spatial-temporal Graph Deep Learning Model for Urban Flood Nowcasting Leveraging Heterogeneous Community Features

Hamed Farahmand , Yuanchang Xu , Ali Mostafavi

分类：机器学习

2021-11-09

本研究的目的是通过整合基于物理和人类感知的特征来开发和测试城市洪播北卡斯的新型结构化深度学习建模框架。我们提出了一种新的计算建模框架，包括基于关注的空间 - 时间图卷积网络（ASTGCN）模型以及实时收集的不同数据流，并在模型中收集，以考虑空间和时间信息和依赖项这改善了洪涝灾害。计算建模框架的新颖性是三倍;首先，由于空间和时间图卷积模块，该模型能够考虑淹没传播中的空间和时间依赖性;其次，它使得能够捕获异构时间数据流的影响，这些数据流可以发挥洪水状态，包括基于物理的特征，例如降雨强度和水高度，以及人类感知数据，例如洪水报告和人类活动的波动。第三，其注意机制使模型能够将其关注最有影响力的特征指示。我们展示了建模框架在德克萨斯州哈里斯县的背景下作为洪水事件的案例研究和飓风。结果表明，该模型为人口普查道级别的城市洪水淹没了卓越的性能，精度为0.808，并召回0.891，这与其他一些新颖的模型相比表现出更好的表现更好。此外，ASTGCN模型性能提高了异构动态功能，仅依赖于基于物理的特征，这表明了使用异源人类感测数据的洪水截图，

translated by 谷歌翻译

An Empirical Investigation into the Use of Image Captioning for Automated Software Documentation

Kevin Moran , Ali Yachnes , George Purnell , Junayed Mahmud , Michele Tufano , Carlos Bernal-Cárdenas , Denys Poshyvanyk , Zach H'Doubler

分类：人工智能 | 计算机视觉 | 机器学习

2023-01-03

Existing automated techniques for software documentation typically attempt to reason between two main sources of information: code and natural language. However, this reasoning process is often complicated by the lexical gap between more abstract natural language and more structured programming languages. One potential bridge for this gap is the Graphical User Interface (GUI), as GUIs inherently encode salient information about underlying program functionality into rich, pixel-based data representations. This paper offers one of the first comprehensive empirical investigations into the connection between GUIs and functional, natural language descriptions of software. First, we collect, analyze, and open source a large dataset of functional GUI descriptions consisting of 45,998 descriptions for 10,204 screenshots from popular Android applications. The descriptions were obtained from human labelers and underwent several quality control mechanisms. To gain insight into the representational potential of GUIs, we investigate the ability of four Neural Image Captioning models to predict natural language descriptions of varying granularity when provided a screenshot as input. We evaluate these models quantitatively, using common machine translation metrics, and qualitatively through a large-scale user study. Finally, we offer learned lessons and a discussion of the potential shown by multimodal models to enhance future techniques for automated software documentation.

translated by 谷歌翻译

Correlation Clustering Algorithm for Dynamic Complete Signed Graphs: An Index-based Approach

Ali Shakiba

分类：机器学习

2023-01-01

In this paper, we reduce the complexity of approximating the correlation clustering problem from $O(m\times\left( 2+ \alpha (G) \right)+n)$ to $O(m+n)$ for any given value of $\varepsilon$ for a complete signed graph with $n$ vertices and $m$ positive edges where $\alpha(G)$ is the arboricity of the graph. Our approach gives the same output as the original algorithm and makes it possible to implement the algorithm in a full dynamic setting where edge sign flipping and vertex addition/removal are allowed. Constructing this index costs $O(m)$ memory and $O(m\times\alpha(G))$ time. We also studied the structural properties of the non-agreement measure used in the approximation algorithm. The theoretical results are accompanied by a full set of experiments concerning seven real-world graphs. These results shows superiority of our index-based algorithm to the non-index one by a decrease of %34 in time on average.

translated by 谷歌翻译

Self-Supervised Object Segmentation with a Cut-and-Pasting GAN

Kunal Chaturvedi , Ali Braytee , Jun Li , Mukesh Prasad

分类：计算机视觉 | 机器学习

2023-01-01

This paper proposes a novel self-supervised based Cut-and-Paste GAN to perform foreground object segmentation and generate realistic composite images without manual annotations. We accomplish this goal by a simple yet effective self-supervised approach coupled with the U-Net based discriminator. The proposed method extends the ability of the standard discriminators to learn not only the global data representations via classification (real/fake) but also learn semantic and structural information through pseudo labels created using the self-supervised task. The proposed method empowers the generator to create meaningful masks by forcing it to learn informative per-pixel as well as global image feedback from the discriminator. Our experiments demonstrate that our proposed method significantly outperforms the state-of-the-art methods on the standard benchmark datasets.

translated by 谷歌翻译

Approaching Peak Ground Truth

Florian Kofler , Johannes Wahle , Ivan Ezhov , Sophia Wagner , Rami Al-Maskari , Emilia Gryska , Mihail Todorov , Christina Bukas , Felix Meissen , Tingying Peng

分类：机器学习 | 人工智能 | 计算机视觉

2022-12-31

Machine learning models are typically evaluated by computing similarity with reference annotations and trained by maximizing similarity with such. Especially in the bio-medical domain, annotations are subjective and suffer from low inter- and intra-rater reliability. Since annotations only reflect the annotation entity's interpretation of the real world, this can lead to sub-optimal predictions even though the model achieves high similarity scores. Here, the theoretical concept of Peak Ground Truth (PGT) is introduced. PGT marks the point beyond which an increase in similarity with the reference annotation stops translating to better Real World Model Performance (RWMP). Additionally, a quantitative technique to approximate PGT by computing inter- and intra-rater reliability is proposed. Finally, three categories of PGT-aware strategies to evaluate and improve model performance are reviewed.

translated by 谷歌翻译

Industrial Scene Change Detection using Deep Convolutional Neural Networks

Ali Atghaei , Ehsan Rahnama , Kiavash Azimi , Hassan Shahbazi

分类：计算机视觉 | 人工智能 | 机器学习

2022-12-29

Finding and localizing the conceptual changes in two scenes in terms of the presence or removal of objects in two images belonging to the same scene at different times in special care applications is of great significance. This is mainly due to the fact that addition or removal of important objects for some environments can be harmful. As a result, there is a need to design a program that locates these differences using machine vision. The most important challenge of this problem is the change in lighting conditions and the presence of shadows in the scene. Therefore, the proposed methods must be resistant to these challenges. In this article, a method based on deep convolutional neural networks using transfer learning is introduced, which is trained with an intelligent data synthesis process. The results of this method are tested and presented on the dataset provided for this purpose. It is shown that the presented method is more efficient than other methods and can be used in a variety of real industrial environments.

translated by 谷歌翻译

Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization

Zahra Shahrooei , Mykel J. Kochenderfer , Ali Baheri

分类：机器学习

2022-12-28

Simulation-based falsification is a practical testing method to increase confidence that the system will meet safety requirements. Because full-fidelity simulations can be computationally demanding, we investigate the use of simulators with different levels of fidelity. As a first step, we express the overall safety specification in terms of environmental parameters and structure this safety specification as an optimization problem. We propose a multi-fidelity falsification framework using Bayesian optimization, which is able to determine at which level of fidelity we should conduct a safety evaluation in addition to finding possible instances from the environment that cause the system to fail. This method allows us to automatically switch between inexpensive, inaccurate information from a low-fidelity simulator and expensive, accurate information from a high-fidelity simulator in a cost-effective way. Our experiments on various environments in simulation demonstrate that multi-fidelity Bayesian optimization has falsification performance comparable to single-fidelity Bayesian optimization but with much lower cost.

translated by 谷歌翻译

Proof of Swarm Based Ensemble Learning for Federated Learning Applications

Ali Raza , Kim Phuc Tran , Ludovic Koehl , Shujun Li

分类：机器学习 | 人工智能

2022-12-28

Ensemble learning combines results from multiple machine learning models in order to provide a better and optimised predictive model with reduced bias, variance and improved predictions. However, in federated learning it is not feasible to apply centralised ensemble learning directly due to privacy concerns. Hence, a mechanism is required to combine results of local models to produce a global model. Most distributed consensus algorithms, such as Byzantine fault tolerance (BFT), do not normally perform well in such applications. This is because, in such methods predictions of some of the peers are disregarded, so a majority of peers can win without even considering other peers' decisions. Additionally, the confidence score of the result of each peer is not normally taken into account, although it is an important feature to consider for ensemble learning. Moreover, the problem of a tie event is often left un-addressed by methods such as BFT. To fill these research gaps, we propose PoSw (Proof of Swarm), a novel distributed consensus algorithm for ensemble learning in a federated setting, which was inspired by particle swarm based algorithms for solving optimisation problems. The proposed algorithm is theoretically proved to always converge in a relatively small number of steps and has mechanisms to resolve tie events while trying to achieve sub-optimum solutions. We experimentally validated the performance of the proposed algorithm using ECG classification as an example application in healthcare, showing that the ensemble learning model outperformed all local models and even the FL-based global model. To the best of our knowledge, the proposed algorithm is the first attempt to make consensus over the output results of distributed models trained using federated learning.

translated by 谷歌翻译